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REGION TRACKING BY WARPING OF IMAGE LABELS 

Field of the Invention 

The present invention relates generally to the analysis of imaging systems and, in 
particular, to registering an image formed by the imaging system with an undistorted 
digital version thereof before performing image analysis. 

5 Background 

There is a general need for measuring the performance of an imaging system. 
The results from such performance measurement may be used for selecting between 
alternative implementations of the imaging systems. 

Until recently the measurement of the performance of imaging systems has 
10 primarily been mediated by human visual interpretation. For example, the performance 
of an imaging system may be measured by imaging a test chart containing a test pattern 
with the imaging system under test, and then comparing the properties of the test pattern 
appearing in the captured image with the known properties of the test pattern. . 

For instance, a process for determining the resolving power of a camera, which is 
15 a property of the performance of the camera, involves taking a photographic image of a 
standard resolution test chart and visually inspecting the image to extract from the image 
the resolving power of the camera. Similarly, the performance of a printing system may 
be measured by printing a known test pattern, and comparing the properties of the printed 
version of the test pattern with the properties of the test pattern. 

A common property of the above described processes used for measuring the 
performance of an imaging system is that a test pattern with known properties is visually 
analysed to characterize the properties of the imaging system. 

Recent advances in digital and electronic imaging have meant that automated 
measurement of the performance of an imaging system has become more common. For 
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example, during the evaluation of image compression processes, such as those using the 
JPEG and MPEG compression algorithms, a pixel-by-pixel comparison is made between 
an image before compression, and that after the compression process has been performed 
on the image. This form of measurement of the performance of an imaging system is 
5 simplified by the fact that the pixels of the images being compared are related, in that the 
compression process only changes pixel values, not pixel locations. In other words, no 
geometric distortion occurs during the compression process. 

The range of image quality parameters calculable in such an imaging system is 
immense because, in essence, the imaging system may be considered to be an imaging 
10 system where each pixel is an independent channel. In practice a small number of 
mathematical image quality parameters are calculated, such as mean squared error (MSB) 
and peak signal to noise ratio (PSNR), as well as human visual system (HVS) related 
parameters. In the area of measurement of performance from digital images and video 
images it is almost taken for granted that the original (uncompressed) image is known and 
15 available for comparison with the compressed image. 

However, in imaging situations other than pure compression, the output image 
cannot be directly compared to the input image because a geometrical transformation, or 
distortion, occurs in the imaging system. For example, when a digital camera captures an 
image of an ISO test pattern, the exact magnification, orientation and perspective 

» 

20 parameters are not known a priori, nor are those parameters easily controlled or fixed, 
except in laboratory controlled research environments. So, in such systems it is not, in 
general, possible to perform a direct comparison of input and output images because the 

, images are not congruent. 

It is often advantageous to compare corresponding regions of input and output 
25 images using higher level descriptors, such as texture, colour, spatial frequency. 
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However, if a distortion occurred between the input image and the output image as a 
result of the imaging system, regions can not be compared, as such regions are not 
guaranteed to be corresponding regions. 

■ 

Summary 

5 It is an object of the present invention to substantially overcome, or- at least 

ameliorate, one or more disadvantages of existing arrangements. 

According to an aspect of the present disclosure, there is provided a method of 
analysing images, said method comprising the steps of: 

receiving first and second images, said second image being a distorted version of 

10 said first image; 

labelling pixels of said first image with pixel labels; 

determining distortion parameters for aligning said first image with said second 
image; 

warping at least said pixel labels using said distortion parameters; and 
associating said pixel labels with corresponding pixels in said second image, 

it 

wherein said labels provide information on a state of pixels in said second image before 
distortion. 

According to another aspect of the present disclosure, there is provided an 
apparatus for implementing the aforementioned method. 

According to yet another aspect of the present disclosure there is provided a 
computer program product having recorded thereon a computer program for 
implementing the method described above. 

Other aspects of the invention are also disclosed. 
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Brief Description of the Drawings 

One or more embodiments of the present invention will now be described with 
reference to the drawings in which: 

Fig. 1 illustrates an arrangement for measuring the performance of a scanning 

device; 

Fig. 2 illustrates an arrangement for measuring the performance of a printer; 

Fig. 3 illustrates an arrangement for measuring the performance of the optical 
imaging system of a camera; 

Fig. 4 shows a schematic block diagram of the general-purpose computer; 

Fig. 5 shows an example one-dimensional scale invariant pattern; 

Fig. 6 shows a representation of an example of a one-dimensional scale invariant 
pattern extended in the transverse direction to cover a defined image area; 

Fig. 7 shows an example alignment pattern image, which is a superposition of 
four one-dimensional scale invariant patterns, each similar to the example one- 
dimensional scale invariant pattern shown in Fig. 6 but with different radius n f and angle 

a ( parameters; 

Fig. 8 shows the preferred configuration the axes of symmetry of four one- 
dimensional scale invariant pattern embedded in the alignment pattern shown in Fig. 7; 

Fig. 9 shows an example test pattern that consists of both a spread-spectrum 
alignment pattern and a high frequency noise pattern; 

Fig. 10 shows the intersection points of the axes of symmetry shown in Fig. 8; 

Fig. 11 shows a flow diagram of a method of generating a dyadic test pattern; 

Fig. 12A shows an example test pattern consisting of tiles having frequency 
responses and orientation covering a predetermined range; 
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Fig. 12B shows an example test pattern derived from the test pattern shown in 
Fig. 12A, but with the spatial locations of the tiles randomly permuted; 

Fig. 13 shows a schematic flow diagram of a method of registering two digital 
images, and then analysing the registered images in order to determine characteristics of 
5 an imaging system under test; 

Fig. 14 shows a schematic flow diagram of a preferred implementation of the 
coarse registration performed in the method of Fig. 13; 

Fig. 15 shows a schematic flow diagram of rotation, scale and translation 
registration performed during the coarse registration shown in Fig. 14 in more detail; 
10 Fig. 16 shows a schematic flow diagram of transforming an image into a 

complex image as performed in the rotation, scale and translation registration shown in 

Fig. 15 in more detail; 

Fig. 17 shows a schematic flow diagram of generating from a complex image an 
image that is substantially invariant to translations as performed in the rotation, scale and 
15 translation registration shown in Fig. 15 in more detail; 

Fig. 18 illustrates some characteristics of resampling an image having Cartesian 
coordinates to an image in the log-polar domain; 

Fig. 19 shows a schematic flow diagram of invariant pattern detection performed 
during the coarse registration shown in Fig. 14 in more detail; 
20 Fig. 20 shows a schematic flow diagram of the preferred method of resampling a 

Fourier Transform into a quasi-polar frequency space; 

Fig. 21 shows a schematic flow diagram of the steps for performing block based 

correlation; 

Fig. 22 illustrates two images and the positions of first tiles within those images 
25 when performing block based correlation; 
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Fig. 23 illustrates the conversion from a grey scale test pattern to a CMYK test 

pattern; 

Fig. 24 shows a schematic flow diagram of the steps for performing 
interpolation; 

5 Fig. 25 shows a schematic flow diagram of the steps for detecting positions of 

peaks in a correlation image; 

Fig. 26 illustrates the warping of pixel labels. 

Detailed Description including Best Mode 

Where reference is made in any one or more of the accompanying drawings to 
10 steps and/or features, which have the same reference numerals, those steps and/or features 
have for the purposes of this description the same function(s) or operation(s), unless the 

contrary intention appears. 

Some portions of the description which follows are explicitly or implicitly 
presented in terms of algorithms and symbolic representations of operations on data 

15 within a computer memory. These algorithmic descriptions and representations are the 
means used by those skilled in the data processing arts to most effectively convey the 
substance of their work to others skilled in the art. An algorithm is here, and generally, 
conceived to be a self-consistent sequence of steps leading to a desired result. The steps 
are those requiring physical manipulations of physical quantities. Usually, though not 

20 necessarily, these quantities take the form of electrical or magnetic signals capable of 
being stored, transferred, combined, compared, and otherwise manipulated. 

Unless specifically stated otherwise, and as apparent from the following, it will 
be appreciated that throughout the present specification, discussions utilizing terms such 
as "scanning", "calculating", "determining", "replacing", "generating" "initializing", 

25 "outputting", or the like, refer to the action and processes of a computer system, or similar 
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electronic device, that manipulates and transforms data represented as physical 
(electronic) quantities within the registers and memories of the computer system into 
other data similarly represented as physical quantities within the computer system 
memories or registers or other such information storage, transmission or display devices. 

When evaluating a general imaging system, an image is (re-)produced by the 
imaging system from a master image. The two images are than compared by calculating 
mathematical image quality parameters. In order for the mathematical image quality 
parameters to be calculated, it is necessary for the images to be registered or aligned. 
Hence, it is necessary to precisely define the geometrical transformation, or distortion, 
required to bring the images into congruence. The distortion itself is also an important 
quality measure of the imaging system. Also, with the distortion applied by the imaging 
system known, attempts may be made to correct images produced by the imaging system. 

Precise image registration is also a principal requirement of high quality colour 
printing processes where the different colour channels have to be aligned precisely. The 
5 mis-registration is usually of the simplest kind, that is to say a translation or shift in 
mathematical terminology. Conventionally registration marks, such as crosses, are 
printed on the printing medium and just outside the main image area in an attempt to 
achieve precise registration of the colour channels. The crosses are usually composed of 
two thin lines which clearly show any misalignment of the printed colour channels. 
10 An alternative approach is to include alignment patterns, not spatially localised 

patterns like crossed lines, but distributed patterns like spread spectrum noise or 
mathematical patterns with special spatial and spectral properties; into the test pattern. 
Such alignment patterns may be embedded at low levels, so that they are not necessarily 
visible to the human eye, yet are still detectable by mathematical procedures such as 
25 matched filtering or correlation. 
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Unlike the printing process where the distortion is a relative translation between 
channels, the distortion occurring in more general imaging systems, such as a camera, 
may also vary spatially. In fact, the most common types of distortion in cameras depend 
on the lens of the camera. Such distortions include barrel or pin-cushion distortion, and 
5 perspective distortion. If the outward radial distortion increases with the distance from 
the image centre the distortion is called pin-cushion distortion, whereas if the inward 
distortion increases with distance from the image centre then the distortion is called barrel 
distortion. 

Using traditional methods, it would be necessary to include registration marks 
10 throughout the image area to allow complete image registration. This is possible, and 
indeed some of the test patterns described below include registration marks throughout 
the test pattern, which is then imaged. However, it is not necessary to explicitly embed 
such registration marks into the test pattern if the test pattern itself has a structure which 
may be called an "intrinsically alignable structure". Mathematical analysis of alignment 
15 processes shows that test patterns in an image are intrinsically alignable if the test patterns 
have a wide Fourier spectral content. In particular, a wide Fourier spectral content 
ensures that correlation based registration gives sharp, low noise alignment. 

From the preceding discussion it is clear that the use of a suitable test pattern 
may allow both precise image alignment and image quality metric evaluation on a pixel- 

4 

20 to-pixel basis. One of the significant advantages of such an approach is that a correlation- 
based alignment process may be used, which is perfect for automation. 

Fig, 1 illustrates an arrangement 100 for measuring the performance of a 
scanning device, such as digital scanner 120. The digital scanner 120 may be any type of 
digital scanner, such as a flatbed scanner, a photocopier with scanner functionality, a 
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drum scanner or the like. The digital scanner 120 interfaces to a general purpose 
computer 200. 

Fig. 4 shows a schematic block diagram of the general purpose computer 200. 
The computer 200 is formed by a computer module 201, input devices such as a 
5 keyboard 202 and mouse 203, and a display device 214. The computer module 201 
typically includes at least one processor unit 205, and a memory unit 206. The 
module 201 also includes an number of input/output (I/O) interfaces including a video 

'4 

interface 207 that couples to the display device 214 and loudspeakers 217, an I/O 
interface 213 for the keyboard 202 and mouse 203, and an interface 208 for interfacing 

10 with devices external to the computer 200, such as the digital scanner 120 (Fig. 1). 

A storage device 209 is provided and typically includes a hard disk drive 210 
and a floppy disk drive 21 1. A CD-ROM drive 212 is typically provided as a non- volatile 
source of data. The components 205 to 213 of the computer module 201, typically 
communicate via an interconnected bus 204 and in a manner which results in a 

15 conventional mode of operation of the computer system 200 known to those in the 
relevant art. 

Referring to Figs, 1 and 4, the arrangement 100 is controlled so that the digital 
scanner 120 scans a calibrated test chart 110 containing a test pattern to form a digital 
image of the test pattern. An advantage of using test charts instead of images containing 
20 natural scenes when performing image quality assessment is that control is gained over 
the structure of the input image because, as is well known from video sequence analysis 
theory, image registration is generally difficult in image regions having little texture. By 
appropriate design of the pattern appearing on the test charts, adequate texture can be 
assured in image regions that require registration. 



671600.doc 



-10- 

The digital image of the test pattern is typically stored on the memory 206 of the 
general purpose computer 200. The general purpose computer 200 also holds in its 
memory 206 a digital representation of the test pattern appearing on the test chart 110 as a 
test pattern image. Both images, those being the image formed by the scanner 120 from 

5 the test chart and the test pattern image, are stored in the memory 206 as a raster array of 
pixel values of some fixed precision or floating point data type with a horizontal and 
vertical dimension. The images are generally colour images, though the registration 
process described below is generally applied to only a single channel derived from the 
other channels, such as the luminance of the image. 

10 The calibrated test chart 110 is produced beforehand from the test pattern 

4 

through a process such as printing, etching, or other means. To be useful in the analysis 
of the performance of an imaging system, the calibrated test chart 110 must be produced 
by a sufficiently accurate process such that the spatial errors in the calibrated test chart 
1 10 are much smaller than those expected from the imaging system. Etching is preferably 
15 used for measuring the performance of the scanner 120 where high precision is required, 
such as the measurement of the modulation transfer function (MTF) of the scanner 120, 
whereas printing is used for producing test charts 110 for use in the measurement of 
characteristics that require less precision. 

Li 

After obtaining a digital image of the test pattern on the test chart 110 using the 
20 scanner 120, the general purpose computer 200 is then controlled to register the digital 
image from the scanner 120 and the test pattern image, and to analyse the registered 
images in order to determine at least one characteristic of the scanner 120. The method 
used for registering and analysing the images is described in detail below. 

« 

Fig. 2 illustrates an arrangement 101 for measuring the performance of a printer 
25 130. The printer 130 may be any type of printer, such as a laser beam printer, an inkjet 
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printer, a dye sublimation printer or the like. In this arrangement 101 the printer 130, as 
well as a calibrated digital scanner 140, are interfaced to the general purpose computer 
200, described in detail with reference to Fig. 4, through I/O interface 208. 

In operation the printer 130 receives a test pattern from the computer 200 and 
5 prints a test chart 150 containing the test pattern. The test chart 150 is then scanned using 
the calibrated scanner 140 in order to form a digital image of the test pattern, which is 
typically stored on the memory 206 of the general purpose computer 200. 

The general-purpose computer 200 is then controlled to register the digital image 
from the calibrated scanner 140 and an image of the test pattern, and to analyse the 
10 registered images in order to determine at least one characteristic of the printer 130. The 
method used for registering and analysing the images is described in detail below. 

Fig. 3 illustrates an arrangement 102 for measuring the performance of the 
optical imaging system of a camera 160. In this arrangement 102 the camera 160 
interfaces with the general-purpose computer 200 through I/O interface 208. 



15 



The camera 160 is controlled to capture an image of a calibrated test chart 170 



containing a test pattern. The calibrated test chart is formed in the manner described with 
reference to Fig. 1. The image of the calibrated test chart is then transferred to the general 
purpose computer and stored in memory 206. The general-purpose computer 200 is then 
controlled to register the image from the camera 160 and an image of the test pattern 

20 appearing on the calibrated test chart, and to analyse the registered images in order to 
determine at least one characteristic of the optical imaging system of the camera 160. 
The method used for registering and analysing the images is described in detail below. 

In arrangement 102 it is possible to use a self-luminous device, such as a liquid 
crystal display (LCD) or cathode ray tube (CRT) display, instead of a more conventional 

25 reflective or transmissive test chart which require external illumination. In fact, the LCD 
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or CRT display may be the display device 214 shown in Fig. 4 and forming part of the 
general computer system 200. A major advantage of using an LCD as test chart .170 is 
that LCD's are fabricated by micro-lithographic techniques on flat glass, which ensures 
extremely good dimensional accuracy and stability, especially when compared to printed 

5 test charts. An advantage of using either of the LCD or CRT display as test chart 170 is 
that changing the test pattern is just a matter of changing the signal sent to the display, so 
that numerous different test patterns may be used. It is also possible to rapidly display a 
sequence of different test patterns. 

In an arrangement not illustrated the performance of the optical imaging system 

10 of an analogue (film) camera may be measured by capturing a photograph of the 
calibrated test chart 170, developing the photograph, then scanning the photograph using 
a calibrated scanner such as scanner 140 (Fig, 2) to form an image of the test pattern on 
the calibrated test chart, registering the image of the test pattern on the calibrated test 
chart and an image of the test pattern, and analysing the registered images in order to 

15 determine at least one characteristic of the optical imaging system of the analogue 

a 

camera. 

It can be seen that in each of the arrangements described with reference to Figs 1 
to 3 two images are formed, a first being a digital representation of a generated test 
pattern, and the second being a captured image containing the same test pattern. The 
20 precise format of the test pattern used in each instance depends on the characteristic of the 
imaging system that is to be measured. 

Before describing the steps of registering the images and analysing the registered 
images in order to determine characteristic of the imaging system under test, be it the 
scanner 120 (Fig. 1), the printer 130 (Fig. 2), the optical imaging system of the camera 
25 160 (Fig. 3), or the optical imaging system of the analogue camera (not illustrated), the 
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generation of a number of test patterns for use in the registration and analysis of the 

4 

images is first described. 

The methods of generating test patterns are preferably practiced using the 
general-purpose computer 200 shown in Fig. 4 wherein the processes of generating the 
5 test patterns are implemented as software, such as an application program executing 
within the computer 200. In particular, the steps of generating the test patterns are 
effected by instructions in the software that are carried out by the computer. Typically 
the application program is resident on the hard disk drive 210 and read and controlled in 
its execution by the processor 205. Intermediate storage of the program and the storage 
10 of the generated test pattern may be accomplished using the memory 206, possibly in 
concert with the hard disk drive 210. 

Each of the test patterns is generated by the general-purpose computer 200 as a 
digital image, which is defined on a raster grid of N pixels by M pixels in the horizontal 
and vertical directions respectively. Each pixel of the test patterns is generally a colour 
15 pixel in some colour space, such as a linear RGB or CMYK colour space. Pixel values 
are preferably integer values in the range of 0 to 255. The test pattern may be converted 
to a single channel image based on the luminance of the colour pixels. 

Two coordinate systems are defined for the test pattern images. The first is an 
offset coordinate system where the location of each pixel is measured relative to the upper 
20 left corner of the image. The second is a Cartesian coordinate system where the x -axis is 
along the horizontal dimension of the image, the y -axis is along the vertical dimension of 
the image, a unit displacement in the Cartesian coordinates represents a single pixel 
displacement on the image, and the origin of the Cartesian coordinates lies at pixel offset 

Note that where the Fourier Transform is used, it is assumed that the 
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origin of the Fourier Transform is at the origin of the Cartesian coordinates. This means 



that the DC value of the Fourier Transform is positioned at pixel offset 



v 



TV 
2 



M 
2 



- 1 / 



, and 



the Nyquist frequency (for images with even width and height) is positioned at pixel 
offset (0,0). The forward and inverse Fourier Transforms are normalised such that the 
5 inverse Fourier Transform is divided by a scaling factor of U(NxM) and the forward 
Fourier Transform has a scaling factor of unity (no scaling). 

Also, where interpolation is used throughout, it is assumed that half sample 
symmetric reflective boundary conditions are used to extend the image to allow 
interpolation of pixels at the edge of the image. 

* 

10 The first test pattern described here is a test pattern used in determining spatial 

inaccuracies in the imaging system under test. The test pattern consists of two 
superimposed patterns, those being an alignment pattern, and a pseudo-random noise 
pattern. 

The alignment pattern in turn is a superposition of four one-dimensional scale 
15 invariant patterns, with each one-dimensional scale invariant pattern extended in a 
transverse direction to cover a defined image area. A one-dimensional basis function 
from which each one-dimensional scale invariant pattern may formed is: 

f(x) = cos(y log|x -x 0 \) (1) 

where yis a constant that specifies how quickly the pattern oscillates and x 0 

20 specifies the symmetry point for the pattern. The reason the basis function is termed a 
"scale invariant" pattern is because the pattern, when correlated with a scaled version 
thereof, still forms a correlation peak. An example one-dimensional scale invariant 
pattern is represented in Fig. 5. Each one-dimensional scale invariant pattern Mx,y) that 
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m 

has been extended in the transverse direction is specified by two further parameters, 
namely a radius , and an angle a t , as follows: 

fi (*> y) = cos(y log|*cos a ( + y sin a t - r ( |) (2) 

■ wherein the angle a x is the angle an axis of symmetry of the scale invariant 
5 pattern fi(x,y) forms with the vertical Cartesian axis, and the radius ^ is the distance from 

the Cartesian origin to the axis of symmetry. Each one-dimensional scale invariant 
pattern fi(x 7 y) also has a Nyquist radius, which is the number of pixels from the axis of 
symmetry of the pattern where the frequency of the pattern is equal to the Nyquist 
frequency of the image. Pixel values within the Nyquist radius from the axis of symmetry 

10 are attenuated. An example of a one-dimensional scale invariant pattern fi(x,y) extended 
in the transverse direction to cover the image area is shown in Fig. 6, together with the 
parameters radius r. and angle a t , and the axis of symmetry. A representation of an 
example alignment pattern image, which is a superposition of four one-dimensional scale 
invariant patterns f£x,y\ each similar to the example one-dimensional scale invariant 

15 pattern shown in Fig. 6 but with different radius r* and angle a t parameters, is shown in 
Fig. 7. In the representation pixel values have been binarized. 

The preferred values of the radius r. and angle a ( parameters for the one- 
dimensional scale invariant patterns are: 

9 

16 
13 

r 2 =P d ,a 2 =—2n 

16 

3 (3) 
16 



P d 15 
r4 ~V2' a4 ~16 



671600.doc 



-16- 



with 



P d = max(256, min(W, M )) /(2 + V2) 



(4) 



and where the Nyquist radius is R NYQ = 50 . This set of parameters r, and has 

been specially chosen so that the axes of symmetry of the one-dimensional scale invariant 
5 patterns fi(x,y) intersect at points that define line segments that have certain ratios of 
lengths. The ratios of lengths are invariant under affine transformations. 

i 

From the preferred parameters above, the configuration of the axes of symmetry 
of the four one-dimensional scale invariant patterns fi(x,y) embedded in the alignment 
pattern is shown in Fig. 8. 



10 



For a test pattern with dimensions N pixels by M pixels, the following quantities 
are pre-calculated for each one-dimensional scale invariant pattern f&x y y) having 
parameters r;- and a, : 



D. = cos 



ft { 

— frac 
2 



1^ 
8 



4 



\ 



N_ 

2 

M 



+ r t cosa i 



+ r. sm a ( 



(5) 



-{X t cos a. + ]P;. sin a. )/ Z>. 

max(256,min(;V\M))/256 



The contribution of pattern ffay) to the pixel of the alignment pattern at offset 



15 (x,y) is P { (x, y) , and is given by: 
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R = R, + (y sin a + xcos a)l D l 



cos(pR WQ \ogf\R\ 



elseif(\R\ > R NYQ and|/?| <= R„ ) ( 6 ) 
3 (•*, y) = cosinR^Q log(ji?|)) 

f5(*.y)=o 



The pixel values of the alignment pattern are then calculated as follows: 



(7) 



The test pattern is generated by adding the raster array of real values representing 
the alignment pattern, that is P(x,y), to a pseudo-random noise pattern. The preferred 
generator of the pseudo-random noise pattern generates values in the range -1 to 1, and 

v 

the test pattern value at coordinate (x,y), denoted RGB(x, y) , may be obtained through: 

t(x, y) = (randorn(x, y, s) + 0.025P(x, y)) 
= max(*(jc, y)) 

'n*. = min(*(x, y)) ( 8 ) 
7 (x, y) = 256(f ( x, y) - ) /(f^ - ) 
i?GB(x, y) = fl/Cx, y)Jlr(*, y) JI/(*. y) J 

where random(x,y,s) represents the random value generated for the pixel at 
coordinate (x,y) and using seed value s. Hence an RGB pixel value is formed for each 
pixel in the test pattern, the RGB pixel value representing a grey level that is determined 
by adding 0.025 times the alignment pattern pixel value P{x, y) to the random number 
generated for that pixel, and then renormalising and quantising the entire image so that it 
takes on integer values between 0 and 255. 
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* 

The test pattern generated using the process described above is a grey scale test 
pattern that has both a spread-spectrum alignment pattern that may be used to determine 
approximate translation, rotation and scaling transformations of the captured test pattern 
efficiently, and a high frequency noise pattern, generated by the pseudo-random noise 

5 generator, that allows very fine spatial registration. A binarized representation of an 
example test pattern is shown in Fig. 9. 

Translation, rotation and scaling are known as affine transformations. A 
property of affine translations is that lines are transformed to lines. Accordingly, when 
the axes of symmetry of the patterns forming the alignment pattern undergo an affine 

10 transformation, then the transformed alignment pattern will still have axes of symmetry 
therein, with only the configurations of the axes of symmetry of the respective patterns 
transformed to different configurations. 

The pattern detection process described below uses this property of affine 
transformation by identifying the positions and angles of the axes of symmetry in the 

15 transformed alignment pattern, allowing the determination of parameters defining the 
affine transform that has been applied to the image containing the alignment pattern. 
Rather than analyse the parameters of the axes of symmetry directly, the method 
described below analyses the points of intersection of the axes of symmetry, which are 
shown in Fig. 10, and in particular the ratios of line lengths joining the points of 

20 intersection of the axes of symmetry, which are invariant to affine transformations. 

As noted, the test pattern generated using the process described above is a grey 
scale test pattern. In the case where different colour channels are to be aligned, such as 
the case where the CMYK colour space channels of a printer are to be aligned, it is 
necessary to use a test pattern containing pixels in the CMYK colour space. Accordingly, 

25 the grey scale test pattern described above has to be converted into a colour test pattern. 
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The conversion starts by the processor 205 dividing the grey scale test pattern 
into a set of 2 by 2 pixel cells. Fig. 23 illustrates the conversion from a grey scale test 
pattern 2300 to a CMYK test pattern 2310. In each of the cells the pixels are labelled as 
Gl, G2, G3, and G4 for the top left, top right, bottom left and bottom right pixels 
5 respectively. Item 2305 shows one such set of 2 by 2 pixel cells, and the labelling of 

pixels within the cell. 

Corresponding pixels in the test pattern containing pixels in the CMYK colour 
space are labelled as CI, M2, Y3, and KA respectively. Each of the pixels represents a 
channel in the CMYK colour space. Item 2315 shows the cell corresponding to cell 2305, 
10 and the labelling of pixels within the cell. The set of four pixels together form a colour in 
the CMYK colour space, with the CMYK colour written in the notation (C1,M2,F3,X4). 
Next the processor 205 attributes values to each of the pixels in the test pattern containing 
pixels in the CMYK colour space, with those values being derived from the values of the 
corresponding set of grayscale pixels. Values for the CI, M2, Y3, and KA pixels are 

ft 

15 attributed as follows: 

CI is given a cyan value of 255 if Gl is greater than 127, and given a value of 0, 
which corresponds to white, if Gl is less than or equal to 127; 

Ml is given a magenta value of 255 if G2 is greater than 127, and given a value 
of 0, which corresponds to white, if G2 is less than or equal to 127; 
20 73 is given a yellow value of 255 if G3 is greater than 127, and given a value of 

0, which corresponds to white, if G3 is less than or equal to 127; and 

KA is given a black value of 255 if G4 is greater than 127, and given a value of 0, 
which corresponds to white, if G4 is less than or equal to 127. 

The CMYK test pattern formed in this manner contains pixels having one of 5 
25 distinct pixel colours, those being white, cyan, magenta, yellow or black. 
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A second test pattern described here, termed a dyadic test pattern, is useful for 
measuring the accuracy of the colour measurements of the imaging system under test. 
The dyadic test pattern contains a number of patches. Each of the patches may have a 
known constant, flat colour. Alternately, the dyadic test pattern may contain patches of 

5 slowly varying colour, patches having a pattern with a specific frequency distribution 
and/or orientation to measure other aspects of the imaging system's performance, patches 
having pseudo-random noise, or combinations of the above. It is convenient for such 
patches to have different sizes within the dyadic test pattern to assist in measuring the 
imaging system's response to different sized input patches. 

10 Fig. 11 shows a flow diagram of a method 700 of generating a dyadic test 

pattern. The patches have different sizes and colour properties, with the properties of the 
patches predetermined dependent on the characteristic of the imaging system that is to be 
measured. 

Method 700 starts in step 705 where the processor 205 receives an initial patch 
15 list, and stores the initial patch list in the storage device 209. The initial patch list 
specifies how many similar regions on the test pattern are to be created. Each similar 
region in one implementation contains a single flat patch of colour and multiple smaller 
patches of colour. In another implementation each similar region contains a single patch 
with slowly varying colour and multiple smaller patches, each also with slowly varying 
20 colour. In yet another implementation each similar region contains a single patch with a 
pattern with a specific frequency distribution and orientation, and multiple smaller 
patches, each also containing a pattern with a specific frequency distribution and 
orientation. 

The initial patch list may specify that the dyadic test pattern is to be generated 
25 containing an 8x8 arrangement of 64 square regions or tiles, each of size 256 pixels by 
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256 pixels, where each region has a single large patch of colour and progressively more 
patches of colour at progressively smaller sizes. 

In step 710 the processor 205 determines whether the (initial) patch list is empty. 
If it is determined in* step 710 that the patch list is empty, method 700 proceeds to step 
5 750 where method 700 ends. 

Alternatively, if the processor 205 determines in step 710 that the patch list is not 
empty, then the method 700 proceeds to step 715 where the processor removes a first 
patch from the patch list. Step 720 follows where the processor 205 determines whether 
the patch removed from the patch list in step 715 has a width or height of only a single 

r- 

10 pixel. 

If it is determined in step 720 that a dimension of the patch under consideration 
is greater that a single pixel, then that patch is subdivided in step 730. In particular, the 
patch is subdivided into four smaller patches, with each smaller patch being a square 
covering a quarter of the area of the patch being divided. In the case where the patch 
15 being divided has a width or height that is an odd number, the patch is divided to form 
smaller patches that are as near as practicable to a quarter of the area of the patch being 
divided. In step 735 that follows the processor 205 selects one of the smaller patches in 
such a way so as to avoid any substantial periodicity in the generated test pattern. One 
method that may be employed by the processor 205 to avoid any substantial periodicity in 
20 the generated dyadic test pattern is to select the smaller patch from the four available 
smaller patches randomly. The unselected patches are added to the patch list in step 740. 
The (smaller) patch selected in step 735 is not subdivided further, and a property, such a 
colour, is assigned to that patch in step 745, with the property assigned to the patch being 
assigned according to the size and location of the patch under consideration within the 
25 dyadic test pattern. After step 745 the method 700 returns to step 710. 
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Also, if it is determined in step 720 that the patch under consideration does have 
a dimension that is only a single pixel, then that patch is not to be subdivided further, and 
a property is assigned to that patch in step 745 as is described above. 

It can be seen that the effect of method 700 is that, each time steps 7 10 to 745 are 
performed, a patch from the patch list is removed, divided into 4 smaller patches, one of 
the smaller patches is assigned a property while the other 3 patches are added to the patch 
list. This process continues until the patches can no longer be divided. 

An example of a manner in which the property of colour may be assigned to a 
patch in step 745 in the case of a dyadic test pattern with an 8x8 arrangement of 64 square 
regions is to assign each of the 64 large patches, resulting from the initial division of the 
regions, a different shade of grey, and to assign all the other (smaller) patches a random 
colour. 

Other multi-scale test patterns may be generated using a method similar to 
method 700 by decomposing other shapes, other than squares, into collections of smaller 
Shapes. For example, triangles may be used as the shapes, and selectively divided into 
smaller triangles through some pseudo random decision process. 

Multi-scale test patterns have the advantage that, not only do they provide 
patches or regions having the required properties, but they also provide for improved 
alignability due to the induced spatial variation. Closely connected with the improved 
alignability is an increase in the spatial frequency content of the test pattern. 

A third test pattern described here is a test pattern having a frequency response 
chosen to provide a good degree of contrast for the imaging system under test, while still 
containing a wide range of frequencies. As is known in the art of image analysis, an 
image is generally alignable if the image contains a pattern with a wide Fourier spectrum. 
For this reason pattern having a frequency spectrum that is flat is often chosen. 
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However, a flat frequency spectrum also has a number of disadvantages in some 
limited circumstances. One disadvantage results from the fact that a pattern having a flat 
frequency spectrum, even when printed in binary, has high frequency energy which is 
higher than that contained in natural patterns. Natural patterns tend to have a frequency 
spectrum that follows a V frequency curve. If the imaging system attenuates high 
frequencies to any extent, or if the imaging system captures or represents the test pattern 
at a resolution below that of the digital version of the test pattern, then much of the energy 
contained in the high frequencies of the test pattern is lost. This may result in an image of 
the test pattern with very poor contrast, which in turn may interfere with the test of the 
imaging system. For example, the focus mechanism for the camera 160 may not operate 
correctly with a test chart 110 containing a spectrally flat test pattern because the focus 
mechanism cannot detect any strong edges. 

An alternative to a test pattern having a flat frequency spectrum is a test pattern 
having a frequency response chosen to provide a good degree of contrast for the imaging 
system, while still containing a wide range of frequencies. An example of such a test 
pattern is a random fractal, which may be generated by a variety of means. One such 
means is to create random noise with a spectral distribution of the form: 



where the function random is a pseudo-random function operating on a seed s, 

* 

F" 1 is the inverse Fourier transform, r is the radial distance from the Fourier origin, and 
parameter 0 is chosen to have some real value, typically between 1 and 3. In the 
preferred implementation parameter p=l, which produces a pattern with fractal dimension 
3 and is highly textured. 
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The use of such a test pattern has the advantage that more energy is available in 
the lower frequencies, resulting in an image captured of the test pattern, such as when 
scanner 120 or camera 160 are used, or an image printed of the test pattern, such as when 
printer 130 is used, to having more contrast. Because the spectral scaling factor, r"^ , is 

5 scale-invariant, computation of the MTF is made easier than it would be where some non 
scale-invariant scaling factor is used. 

Generally, to allow accurate alignment, the test pattern should contain areas with 
a large amount of fine detail. The finer the detail, the more distinct each pixel is when 
compared with its neighbours. Regardless of any fine detail contained in the areas, if 

10 such areas are repetitive or have properties similar to neighbouring areas, then there is a 
probability that areas may be incorrectly registered with neighbouring areas instead, as 
such areas may become indistinguishable. It is therefore preferable for areas in the test 
pattern not to be repetitive. 

In view of the foregoing, a fourth test pattern described here is a test pattern 

15 wherein the spatial locations of elements within the test pattern are randomly permuted to 
improve registration. For example, consider a test pattern consisting of differently 
coloured tiles, with the tiles all having the same size. It may be required that the colours 

■ 

of the tiles cover a predetermined range of hues, saturations and brightness in a regular 
manner. According to the fourth test pattern, the colour patches formed are randomly 

20 permuted so that there is essentially no regularity to the way that the colours of the 
patches change. Tile boundaries within this fourth test pattern are now between tiles with 
colours that are substantially different, producing improved structures for facilitating 
improved image registration. 

It is noted that the characteristics of the elements are not limited to colour. For 

25 example, consider a test pattern consisting of tiles having frequency responses and 
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orientation covering a predetermined range, such as the test pattern shown in Fig. 12A. 
As can be seen, neighbouring tiles have very similar frequency responses and orientation, 
which may adversely effect registration. Fig. 12B shows a test pattern wherein the spatial 
locations of the tiles from the test pattern shown in Fig. 12A are random permuted to 

5 improve registration. 

It is often desirable to measure multiple characteristics of an imaging system 
without the need to produce multiple test patterns, to form test charts from the test 
patterns where applicable, to capture an image of each of the test patterns, and then to 
analyse the resulting images. To that end it is often advantageous to combine elements 

10 from different types of test patterns onto a single test pattern. It should be apparent that 
different types of test patterns may be combined onto a single test pattern by placing two 

or more test patterns on a single test chart, or by combining elements of these test patterns 

> 

in other ways. A person skilled in the art would understand that some combinations of 
test patterns are not permissible. For example, flat colour patches of a dyadic test pattern 
15 for use in colour measurement can not be combined with other elements without affecting 
the constant colour. However, such restrictions can be localized to relatively small parts 
of the test pattern. 

It is also often desirable to surround each test chart with a region filled with a 
colour noise pattern. This allows accurate alignment right to the edge of the test pattern. 

20 Additional information regarding the nature of each pixel, such as a label 

identifying to which patch in the test pattern the pixel belongs in the case of dyadic test 
patterns, or the texture, colour or spatial frequency of the region in the test pattern to 
which the pixel belongs, may be stored as metadata to the file containing the test pattern 
image when stored on the memory 206. Such information may be used for high level 

25 comparison of regions within a test pattern image and an image captured using the 

■ 
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imagmg 



system under test containing the test pattern, and may be a label or a high level 



descriptor. 

For high level comparison to work, the correspondence of regions need to be 
found with pixel or sub-pixel accuracy. This is requires so that it is clear exactly which 
pixels form the region denoted by the higher level descriptor or label. 

With a number of useful test patterns described above, as well as the manners in 
which a digital image of the test pattern is formed for each of the arrangements 100, 101 
and 102 shown in Figs. 1 to 3, Fig. 13 shows a schematic flow diagram of a method 1000 
of registering two digital images, and then analysing the registered images in order to 
determine characteristics of the imaging system under test. In particular, the two images 
registered by method 1000 are a. test pattern image 1005, which is a digital representation 
of one of the test patterns described above (or a combination of the test patterns), and an 
image 1010 formed by the imaging system under test containing the test pattern and in the 
manner described with a reference to Figs. 1 to 4. The dimensions of these images are not 
necessarily equal. 

Method 1000 starts in step 1015 where a coarse registration of images 1005 and 

■ 

1010 is performed. In the simplest form the coarse registration is achieved by -mechanical 
means prior to capturing image 1010, for example by using a guide template (not 
illustrated) when placing the test chart 1 10 onto the scanner 120 (Fig. 1). 

In a preferred implementation the coarse registration of step 1015 is performed 
by the processor 205 and in a manner described in more detail below with reference to 
Fig. 14. The output of the preferred coarse registration performed in step 1015 is 
registration parameters 1020, which are a set of linear transformation parameters 

■ 

(a n ,a 12 ,a 21 ,a 22 ,x 0 ,y 0 ). The set of linear transformation parameters 
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(a„,a 12 ,a 21 ,a 22 ,x 0 ,y 0 ) relates the pixel coordinates (x,y) (in the Cartesian coordinate 
system) to transformed coordinates (x, y) through: 



x^ 




a n a i2^ x 
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fx ^ 


(10) 
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When a transformation defined by the registration parameters 1020 is applied to 
5 the test pattern image 1005, then images 1005 and 1010 should be coarsely aligned, 

generally to within a few pixels. 

Accordingly, after step 1015 method 1000 proceeds to step 1025 where the 
processor 205 uses the set of registration parameters 1020, that is parameters 
(a u ,a I2 ,a 21 ,a 22 ,^ 0 ,y 0 ), to transform the test pattern image 1005 to thereby form a 
10 coarsely registered test pattern image 1030 which is coarsely registered with the image 
1010 formed by the imaging system under test. In particular, the value at coordinate 
(x,y) in the coarsely registered test pattern image 1030 has the luminance value of the 
pixel at coordinate (x, y) in the test pattern image, where coordinate (x, y) is determined 
by an inverse of the linear transformation represented by the registration parameters 1020 



15 as follows: 



20 
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For coordinates (x,y) that do not correspond to pixel positions, bi-cubic 
interpolation is used to calculate the luminance value for that position from neighbouring 



values. 



Images 1010 and 1030 are then input into step 1035 where the processor 205 
performs block-based correlation and in a manner described in more detail below with 
reference to Fig. 21. The block-based correlation of step 1035 divides the images 1030 
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and 1010 into smaller, possibly overlapping blocks, and generates a displacement map 
1037 that represents the displacement that is required to be performed on each block of 
the coarsely registered test pattern image 1030 in order to map the pixels of the blocks of 
the coarsely registered test pattern image 1030 with that of the image 1010. The 
5 displacement map 1037 formed by the block-based correlation of step 1035 is then 
interpolated in step 1040, using bi-cubic interpolation, to form a distortion map 1045. 
The distortion map 1045 represents the distortion to sub-pixel accuracy that maps each 
pixel of the coarsely registered test pattern image 1030 to the corresponding pixel in the 
image 1010 formed by the imaging device under test. 
10 The distortion map 1045 together with the registration parameters 1020 resulting 

from the coarse registration performed in step 1015 are then used by the processor 205 in 
step 1050 to warp the test pattern image 1005 to form a registered test pattern image 
1055. The warping performed in step 1050 starts by modifying the distortion map 1045, 
which represents the distortion that maps the pixels of the coarsely registered test pattern 
15 image 1030 to pixels of image 1010, to a distortion that maps the pixels of the test pattern 
image 1005 to pixels of image 1010. This is done by adding the linear transformation 
determined in the coarse registration step 1015 to the distortion map 1045 as follows: 



20 total warping performed in step 1050. 

* 

Next a new image having the same size as that of image 1010 is formed, with all 
pixel values set to null, This new image, when populated with pixel values, will be the 
registered test pattern image 1055. For each pixel in the new image a pixel value is 
calculated by first determining the warping applicable to that pixel position. The 




(12) 



wherein D\iJ) represents the distortion map 1045 and D"(iJ) represents the 
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processor 205 then calculates the pixel value by determining the pixel value in image 
1005 that corresponds with that warped position. As it is likely that the warped position 
will not correspond with a pixel position in image 1005, interpolation is used for 

« 

calculating the pixel value, which is then stored in memory 206. 
5 The registered test pattern image 1055 typically includes several pixel 

components, for example red, green, and blue intensity components, as well as a label or 
higher level descriptor. Accordingly, all pixel components including the label or higher 
level descriptor are warped in step 1050 to form the registered test pattern image 1055. 
By warping the label a direct comparison of image regions can be made by mapping the 
10 labels of the registered test pattern image 1055 onto the imaged test pattern 1010. 

When performing the interpolation to calculate the pixel values of the registered 
test pattern image 1055, different interpolation methods may be used to interpolate each 
pixel component. For example, the red, green and blue components may be calculated 
using bi-cubic interpolation, whereas the integer label may be calculated using nearest- 
15 neighbour interpolation. The label channel, which is typically formed by integers, is 
interpolated using nearest neighbour interpolation to ensure only integers result in the 
output. Other interpolations techniques would tend to average adjacent labels resulting in 
non-integer labels, or labels with integers that did not occur previously. The nearest 
neighbour interpolation of the labels results in a warped label map with labels accurate to 

20 the nearest half pixel. 

Fig. 26 illustrates the warping of pixel labels. A test pattern image 2600 is 
illustrated having 3 coloured/textured regions 2601, 2602 and 2403 on a white 
background. A label map 2610 of test pattern image 2600 is also illustrated, with the 
labels of the pixels in the label map 2610 indicating the region 2601, 2602 or 2403 to 

25 which the pixel belongs. In particular, the labels of the pixels in the pixel map 2610 have 
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10 



integer values "0", "1", "2" or "3" dependent on whether the pixel is part of the white 
background, or regions 2601, 2602 or 2603. 

An image 2620 captured of the test pattern image 2600 is further illustarted. As 
can be seen, the image 2620 is a greatly distorted version of the test pattern image 2600. 
During step 1050 described above where a registered test pattern image (not illustrated) is 
formed by warping the test pattern image 2600, a warped label map 2620 is formed. The 
labels of the pixels in the warped label map 2630 indicate the region 2601, 2602 or 2403 

to which the pixel belongs. 

The warped label map 2630 may be directly superimposed onto image 2620, and 
used as a label layer of image 2620. The process may also be viewed as the 
reintroduction of image metadata (on a pixel-by-pixel basis) after an imaging process 
(such as printing-scanning) that lost the metadata linkage. 

Once the label layer is inserted into the image 2620, all manner of higher level 



descriptors can be defined and calculated. For example, it might be that a label "3" 
15 denotes regions 2603 with certain textural properties, as is illustrated in Fig. 26. By 
computing the ensemble pixel value statistics of all pixels labelled "3" within image 
2620, the global textural properties of region 2623 are known. 

The registered test pattern image 1055 is precisely aligned to the image 1010 
formed by imaging system under test. The final step of method 1000 is step 1060 where 
20 the processor 205 uses one or more of the images 1010 and 1055, the distortion map 
1045, and the registration parameters 1020 to analyse the registered images 1010 and 
1055 in order to determine characteristic of the imaging system under test. 

For example, the distortion map 1045 represents the "fine" part of the mapping 
of pixels- in the imaged test pattern 1010 to pixels in the test pattern image 1005. 
25 Accordingly, in the case of scanner 120 (Fig. 1), the distortion map 1045 represents 
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inaccuracies in the scanning process, such as those that may be caused by non-constant 
drive speed, an aspect ratio error, or some other physical errors in the scanner's scanning 
mechanism. The distortion map 1045 also represents the distortions introduced by the 
lens of camera 160, and inaccuracies in the drive mechanism of the printer 130. 
5 Another form of the analysis of the quality of an imaging device is the 

measurement of the modulation transform function (MTF) of that device. The MTF may 
be measured using any test pattern with a suitable spread-spectrum. Examples of 
appropriate patterns include a two-dimensional M-sequence, greyscale pseudo-random 
noise, binarized pseudo-random noise, and two-dimensional perfect binary arrays (which 
10 have perfect autocorrelations for certain array sizes; perfect being defined as an 
autocorrelation with no sidelobes). Any pattern having sufficient detail may be used for 
accurate alignment and may be incorporated in the test pattern. 

The MTF is calculated by the processor 205 by dividing each pixel of the 
modulus of the Fourier transform of the imaged test pattern 1010 by each pixel of the 
15 modulus of the Fourier transform of the registered test pattern image 1055, thereby 
producing an image of the two-dimensional MTF of the scanner 120. It is possible to 
localize and repeat the MTF measurement at various locations around the image area, so 
that spatial variations of the MTF may be detected. 

With accurate alignment it is also possible to estimate the full (complex) optical 
20 transfer function (OTF), not just its modulus, which is the MTF. The full OTF is 
calculated by the processor 205 by taking the Fourier transform of the system point 
spread function (PSF). The advantage of calculating the OTF is that the system PSF may 
be calculated directly, whereas the more common MTF measurement does not allow PSF 
estimation. The various transfer functions are only applicable to linear systems, which in 
25 practice means that these calculations need to be carried out in the correct colour space, 
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with particular care being taken to use the correct contrast model or gamma. The reason 
alignment is important when performing the OTF measurement is that any distortions in 
the imaged test pattern 1010 will result in phase errors in the Fourier Transform of the 
imaged test pattern 1010, which in turn will affect the OTF measurement. 

Yet further, analysis and calibration of the colour response of the scanner 120 
may be implemented using a colour test pattern containing a distribution of suitably sized 
colour patches, such as the dyadic test pattern described above. Typically the colour 
calibrated test chart 1 10, which is scanned by the scanner 120 under test, is fabricated on 
a printing device, with the colour patches calibrated with respect to a known colour 
standard using a device such as a scanning spectrophometer. Because the pixels in the 
registered test pattern image 1055 are aligned with those in the imaged test pattern 1010 
in the manner described above, the processor 205 has knowledge to which patch each 
pixel belongs. The. processor 205 then combines the colour of the pixels of different 
patches of the imaged test pattern 1010 by averaging in a suitable colour space, and then 
compares the average colour of each patch with the known spectrophometer value. A 
colour profile may then be generated using maximum likelihood or least-squares 

methods. 

Another form of analysis is termed granularity, and is determined when the 
imaging system under test is the printer 130 (Fig. 2). Granularity attempts to measure 
how smooth a patch of solid colour printed on the printer 130 appears to a human viewing 
the patch at a nominal viewing distance. For this analysis, test chart 150 contains the 
dyadic test pattern. The granularity of a patch is calculated by the processor 205 from the 
standard deviation of the luminance values of the patch after it has been filtered by a 
visual transfer function (VTF) that roughly mimics- the effect of the human visual system 
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when viewing a piece of paper at a distance of 30cm. The VTF used is described in 



frequency space by: 



V(f X ,fy) = \ 



\5.05(e^ m * 5f Xl - e^ 5f ):f>l (13) 
1 :/<l 



where f 2 =f? + f? and/ is measured in cycles/mm on the scanned page. 

5 The filtering is performed by selecting a region of the image, R, twice the size of 

the patch and extending around the patch for which the granularity is to be measured. 
The luminance of this region R is then filtered using: 

R'=3- { {3{R)-V(f x J y )\ (14) 

where 3 and 3" 1 are the two-dimensional FFT and inverse FFT respectively. 
10 The granularity is then measured by the processor 205 by taking a square region 

of n by n pixels within the patch that covers a significant fraction of the original patch but 
avoids its borders, and measuring the mean and standard deviation of the luminance 
values within this square region of n by n pixels through: 

R=±±R\j (15) 

15 and 



where G is the granularity of the patch, R' u $ is the luminance value of a pixel in 

the region, and ^is the mean of the luminance values of the pixels in the region. The 
final measure G corresponds to how grainy a nominally flat colour appears to a human 
20 observer at a specific (viewing) distance from the printed test chart 150. The granularity 
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may also be thought of as a measure of the deviation of a colour patch from its mean 
level. 

When evaluating a colour printer, such as a CMYK printer, it is desirable to also 
measure the alignment of different colour channels. For example, the C channel of an 

5 image printed in the. CMYK colour space may be several pixels offset from other 
channels due to some mechanical inaccuracy in the printer 130. This misregistration 
leads to noticeable visual defects in the printer's output, namely visible lines of white 
between objects of different colour that should not be present. Detecting and preventing 
such errors is an important problem to be solved in the design and manufacture of colour 

10 printers. 

For this analysis the colour test pattern consisting of the alignment pattern 
superimposed with the pseudo-random noise pattern is used. Also, during performance of 
the block based correlation in step 1035 (Fig. 13), correlation is performed between the K 
channel of the coarsely registered test pattern image 1030 and the K component of the 

15 imaged test pattern 1010, with the K component of the imaged test pattern 1010 
calculated from the RGB values as K = Min(255 - R, 255 - G, 255- B). This produces a 
registered test pattern image 1055 in which the black pixels thereof are aligned precisely 
with the black pixels of the imaged test pattern 1010. Due to the possibility that the 
colour channels are mis-registered in the printing process, the C, M and Y channels of the 

20 registered test pattern image 1055 may not be precisely aligned with the imaged test 
pattern 1010. 

The block based correlation of step 1035 is then performed between each of the 
C, M and Y channels of the registered test pattern image 1055 and those of the imaged 
test pattern 1010 in order to produce a distortion map for the C, M and Y channels, each 
25 of which being similar in -form to the K channel distortion map 1045. Each of the C, M 
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and Y channel distortion maps shows how the misregistration of the corresponding colour 
channels with respect of the K channel varies across the printed page. These distortion 
maps, or information derived therefrom, may be supplied to a field engineer to allow 
physical correction of the misregistration problems, or alternately, they may be input to 
5 the printer 130 for use in a correction circuit that digitally corrects for the printer colour 
channel misregistration. Printer colour channel misregistration is typically caused by the 
paper rollers thereof not being exactly centred circular cylinders. 

When measuring the performance of the optical imaging system of camera 160 
Fig. 3, it is often useful to know the level of distortion present in a camera image. 
10 Typically the distortion may be pincushion or barrel distortion. The distortion typically 
depends on the lens used by the camera 160. In the case of a zoom lens the distortion will 
vary with the focal length selected. The distortion map 1045 shows the residual 
distortions that remain after perspective distortions have been removed. The residual 
distortion is further decomposed into radial components (purely related to the distance 
15 from the centre of the image) and other components. The other components are typically 
expected to be negligible in a camera with a symmetrical lens, although precision 
registration allows the measurement of any deviation from the ideal. The common lens 
distortions are related to the cube power of the radial distance from the image centre. If 
the outward radial distortion increases with the distance from the image centre the 
20 distortion is called pincushion distortion, if the inward distortion increases with distance 
then it is called barrel distortion. Calculating a third order least-squares fit to the residual 
distortion map 1045 determines precisely the pincushion or barrel distortion. 

As noted above, in arrangement 102 it is possible to use an LCD instead of a 



more 



conventional reflective or transmissive test chart. The chromatic aberration of the 
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optical imaging system of the camera 160 is estimated by the processor 205 by comparing 
separate distortion maps for each of the R, G and B channels in the RGB colour space. 

Care needs to be taken to avoid artefacts of the test chart interfering in the 
measurement. A test chart based upon an LCD structure inherently has an RGB channel 
displacement due to the displaced R, G, and B picture elements of the display. This 
displacement is known and fixed, and so can be eliminated from the final chromatic 
aberration estimate. The difference in the R, G, and B distortion maps minus the LCD 
structural RGB channel displacement directly represents the total camera chromatic 
aberration. The chromatic aberration may be further analysed to separate the optical 
chromatic aberration (from the lens) and any chromatic effects related to the Bayer colour 

filtering on the image sensor. 

The chromatic effects described above are usually referred to as lateral chromatic 
aberration. Another type known as axial chromatic aberration does not produce the 
distortion effects above, but instead introduces a change in the power spectral function 
shape, and in particular power spectral function width, related to the wavelength of light. 
Axial chromatic aberration estimation is facilitated by repeating the aforementioned 
spatially varying power spectral function measurements, but for three separated channels, 
R,G, and B. The difference in the power spectral function widths of the separate channels 
characterises the axial chromatic aberration. 

The block based correlation performed by the processor 205 in step 1035 is now 
described in more detail with reference to Fig. 21 in which a schematic flow diagram of 

< 

the steps for performing the block based correlation is shown. Step 1035 operates on two 
images, those being the imaged test pattern 1010 and the coarsely registered test pattern 
image 1030 resulting from step 1025. The size of images 1010 and 1030 are N2 by Ml 
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pixels and Nl by Ml pixels respectively. The coarsely registered test pattern image 1030 
is padded appropriately to be the same size as the imaged test pattern 1010. 

During block based correlation, each of the two images are divided in to smaller 
tiles of dimension Q by Q pixels, with the positions of the tiles corresponding. Fig. 22 
5 illustrates images 1010 and 1030 and the positions of their first tiles 2201 and 2202. 
Correlation is then performed on the tiles 2201 and 2202 to determine the translation that 
best relates the two tiles. A next pair of tiles is then formed from images 1010 and 1030 
by "stepping" through the images 1010 and 1030 by a step size 5. Correlation is then 
repeated between the newly formed tiles. These steps are repeated until all tiles formed 
10 from images 1010 and 1030 by stepping have been processed. In the preferred 
implementation the parameters are Q = 256 and S = 32. 

The output of the block based correlation step is displacement map 1037 that 
represents the warp that is required to map the pixels of the coarsely registered test pattern 
1030 to the imaged test pattern 1010, as well as a confidence measure for each 
15 displacement. The displacement map is a raster image of dimension 
D X =1(N1 + Q-1)/S] by D y =\_(M\ + Q-l)l S \, of displacement vectors and 

confidence estimates. 

Referring again to Fig. 21, step 1035 starts in sub-step 2030 where 
corresponding tiles are formed from images 1010 and 1030. The tiles have to contain 
20 pixel values from images 1010 and 1030 only, hence lie within those images 1010 and 
1030. For pixel (ij) in the displacement map 1037, the tile from image 1030, and the tile 
from image 1010 have the following coordinates identifying their respective positions in 
images 1030 and 1010: 

tilel: iiVl/2>(i-L^ (17) 
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tile 2: (LiV2/2j+(i-L^/2>-b2/2jLM2/2j+0-LD,/2j)S-|j2/2j. (18) 

Processor 205 next in sub-step 2050 applies a window function, such as a 
Hanning window, to each of the tiles, and the two windowed tiles are then phase 

correlated in sub-step 2060. 

The result of the phase correlation in sub-step 2060 is a raster array of real 
values. In sub-step 2070 the processor 205 then determines the location of a highest peak 
within the raster array, with the location being relative to the centre of the tile. The 
location of the peak is then stored by the processor 205 in sub-step 2080 into memory 206 
in the displacement map 1037 at position (ij), along with the square root of the height of 
the peak as a confidence estimate. If it is determined in sub-step 2085 that more tiles 
exist, then step 1035 returns to sub-step 2030 where a next pair of tiles is formed. 
Alternatively step 1035 ends in sub-step 2090. 

The interpolation performed in step 1040 (Fig. 13) is now described in more 
detail with reference to Fig. 24 in which a schematic flow diagram of the steps for 

f 

performing the interpolation is shown. The interpolation of step 1040 forms the distortion 
map 1045 from the displacement map 1037. Some values in the distortion map 1045 may 
map pixels in the coarsely registered test pattern image 1030 to pixels outside the 
boundary of the imaged test pattern 1010. This is because the imaging device may not 
have imaged the entire test pattern. 

Step 1040 starts in sub-step 1920 where the processor 205 determines the set of 

linear transform parameters, (b u ,b l2 ,b 21 ,b 22 ,Ax ,Ay), that best relates the displacement 

map 1037. 

The (undistorted) points in the imaged test pattern 1010 are labelled (^,y l7 ) for 
pixel (ij) in the displacement map 1037, and are given by: 
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U- . y,j )= 1^2/ 2>(/ - / 2 J)S ,LM 2/ 2 J+ (j - [D, / 2 > ) 



(19) 



These points are displaced by the displacement map 1037 to give the 

displaced coordinates, fo.yj. given by 

(vJiHW/). (20) 
where D(i, ;') is the displacement vector part of the displacement map 1037. 
The linear transformation parameters, acting on the undistorted points give affine 
transformed points, fa, y t j), given by 



X- 

y 






M 






\ J i 




^12 b 22j 






Ay j 



(21) 



The best fitting affine transformation is determined by minimising the error 
between the displaced coordinates, and the affine transformed points %,y v ) by 

changing the affine transform parameters ( b n , b l2 , b 2l ,b 22 ,Ax,Ay ). The error functional 
to be minimised is the Euclidean norm measure E: 



E = f,{x n -xJ + {y n -yJ 

n=l 



~ \2 



(22) 



The minimising solution is 



011 } 








=M _1 




Ax 







(23) 
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(24) 



with 
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(25) 



M _l = 



M 



$x S xy "^xx^y 
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S x S xy 



Sxx$y 



(26) 



and 



10 



|M| = detM = -S S^S^ + 25,5^5, - S„S y S, " 5 + 55 A 



(27) 



where the sums are carried out over all displacement pixels with non-zero 
confidence estimates on the displacement vectors in the displacement map 1037. 

The interpolation step 1040 continues to sub-step 1930 where the best fitting 
linear transformation is removed from the displacement map 1037. Each displacement 
map pixel is replaced according to: 



D(iJ)-*D{iJ)- 



X ij 



b \\ hi 



(28) 



The displacement map 1037 with the best fitting linear transform removed is 
then interpolated using bi-cubic interpolation in sub-step 1940 to a displacement map of 
dimension D X P by D y P. 

* 

A complication in the interpolation step 1940 is what to do if the displacement 
15 map has a pixel with zero confidence in the neighbourhood of the bi-cubic interpolation 
kernel. If this occurs, the pixel with zero confidence is itself substituted by an estimated 
value using an average of neighbouring pixels weighted by their confidence value. If no 
neighbouring pixels have positive confidence, a region-growing algorithm is used to 
determine the pixel value. The interpolated displacement pixel may now be computed 
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using bicubic interpolation using the pixels with positive confidence along with the 
substituted pixels in the displacement map. Finally, in step 1950, the interpolated 



displacement map has the removed best fit linear distortion reapplied according to: 



\ 



D'(i, j) ->D'(i, j)+ 



b n b 22 [y,j) {/sy) 



(29) 



5 where in this case 

fe.v,)=i^2/2j+(//5-L^/2M^ 2/2 >0 /5 -L^/2js). (30) 

The map D\iJ) forms the output of the interpolation step 1040, which is the 

distortion map 1045. 

Referring again to Fig. 13, the preferred coarse registration performed in step 
10 1015 is now described in more detail with reference to Fig. 14. As was described above, 
numerous different test patterns may be used. In sub-step 1110, the processor 205 
determines whether the test pattern includes an alignment pattern, that is whether the test 
pattern includes the first test pattern described with reference to Figs. 5 to 9. In the 
simplest implementation this may be determined from an input received from an operator 
15 of the computer 200. The input also includes the parameters of the test pattern. In an 
alternative implementation a search for an alignment pattern may be performed by the 
processor 205 on the test pattern image. 

If it is determined that the test pattern does not include an alignment pattern, then 
step 1015 continues to sub-step 1120 where rotation, scale and translation (RST) 
20 registration is performed on the luminance channels of the images 1005 and 1010 
respectively. In the alternative, if it is determined in sub-step 1110 that the test pattern 
does include an alignment pattern, then step 1015 continues to sub-step 1130 where 
invariant pattern detection is performed. Sub-step 1130 is also performed on the 
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luminance channels of the images 1005 and 1010 respectively. Each of the RST 
registration performed in sub-step 1120 and the invariant pattern detection performed in 
sub-step 1130 is described in more detail below. The output of each of the RST 
registration in sub-step 1120 and the invariant pattern detection in sub-step 1130 is 

registration parameters. 

When registering an imaged test pattern image 1010 formed using arrangement 
102 which includes camera 160 and shown in Fig. 3, perspective distortion has to be 
estimated and compensated for. Accordingly, for arrangements including camera 160 
sub-steps 1140 to 1170 described below are performed in order to estimate registration 
parameters 1020 whereas, for the arrangements not including a camera, such as 
arrangements 100 (Fig. 1) and 101 (Fig. 2), the output of either of sub-steps 1120 or 1130 

is the registration parameters 1020. 

Accordingly, for arrangements including camera 160 the output of either of sub- 
steps 1120 or 1130 is initial registration parameters 1140. In sub-step 1150 the initial 
registration parameters 1 140 are used to transform the test pattern image 1005 in a 
manner similar to step 1025 (Fig. 13) to thereby form an initially registered test pattern 
image 1160 which is coarsely registered with the image 1010 formed by the camera 160. 
The processor 205 then performs a block based correlation between the initial registered 
test, pattern image 1160 and the imaged test pattern 1010 in a manner similar to that 
described for step 1035 (Fig. 13). The output of the block based correlation step 1170 is 
a displacement map, which is a N x by N y image with three components, where the first 
two components of the pixel at location (j,k) represent the displacement (a ( #,A ( #) of 
the pixel at {x Jk , y Jk ) in the test pattern image 1005, to its position in the imaged test 
pattern 1010, and the third component represents a confidence estimate F Jk for which a 
non-zero value indicates that the block correlation step 1170 was successful in 
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determining a displacement for the block centred on that pixel. This displacement map 
may be used with a standard least squares minimisation algorithm such as the Levenberg- 
Marquardt algorithm to find the values of the perspective transformation parameters, 
(b n , b l2 , b l3 , b 2i , b 22 , b 2i , & 3I ; b 32 ), that minimise the following error functional 

I 

wherein 

^_ b u x + b l2 y + b n ^ 

b 3l x + b n y + l ' (32) 
b^x + bny + fe 23 

* 

The result of this minimisation is a set of perspective transform parameters 

« 

(b u ,b n ,b^K ,b 22 ,b 23 ,b 31 ,b i2 ) that represents the best fitting perspective transformation 
10 to the displacement map calculated by the block correlation step 1170. To improve the 
convergence of the minimisation, the initial values for the perspective parameters 
(b n A 2 A^ 2l ,b 22 ,b 23 ,b 3x ,b %2 ) are set from the values of the initial registration 

parameters 1 140 through: 

b n = a n 
b l2 = a n 

b->i = 

(33) 

22 " "22 

^23 = yo 

b 32 = 0. 

Fig. 15 shows a flow diagram of the RST registration of sub-step 1120 in more 
detail. Sub-step 1120 starts in sub-step i205 where the luminance channels of the test 



'21 - M 21 

15 b 22 = a 
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pattern image 1005 and the image 1010 formed by the imaging system under test are 
transformed into complex images 1206 and 1207 respectively. The complex images 1206 
and 1207 have both real and imaginary parts. 

Fig. 16 shows a flow diagram of step 1205 of transforming an image 1305 into a 
plex image 1360 in more detail. Image 1305 may be a test pattern image 1005 or the 
image 1010 formed by the imaging system under test, both of which are real images 
where the pixels are represented by real numbers. The complex image 1360 formed by 
step 1205 is one where the image pixels are represented by complex numbers. 

Step 1205 operates by encoding directional features of the image 1305 in such a 
way that, when a translation invariant is calculated, images rotated by 180° can be 
distinguished. 

Step 1205 starts in sub-step 1310 where the image 1305, which is an N by M 
aiTay of real numbers, is resized by a process of successive halving of the image size until 
the minimum image dimension N or M is smaller than 512. Halving the size of an image 
may be done through the use of a spatial low-pass filter and down-sampling, as is known 
in the art. Halving the size of the image increases the speed of processing by reducing 
data sizes, and may, in certain circumstances, improve the quality of the measurements. 

The processor 205 then pads in sub-step 1315 the resized image produced in sub- 
step 1310 to twice its size by inserting zeros around the boundary of the resized image. 
The zero-padded image formed by sub-step 1315 is then Fourier transformed in sub-step 
1320 through the use of a Fast Fourier Transform (FFT). In sub-step 1325, the result of 
this Fourier transform is then multiplied by a complex spiral of the form: 



_ u + iv 



yju 2 +V 2 



(34) 
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where (u,v) are Cartesian frequency coordinates in the Fourier domain with 
their origin at the DC offset at I2\\_M /2j). The result of the multiplication in sub- 
step 1325 is then multiplied by a further complex spiral in sub-step 1330, and the result of 
this multiplication is then inverse Fourier transformed through an inverse FFT in sub-step 
1335. The result of sub-step 1335 is then multiplied with the zero-padded image that was 

the result of sub-step 1315. 

The processor 205 also applies an inverse FFT to the result of sub-step 1325. 

The result of sub-step 1345 is then squared in sub-step 1350. 

Next the result of sub-step 1340 is subtracted in sub-step 1355 from the result of 
sub-step 1350 to form the complex image 1360. As set out above and also referring to 
Fig. 15, if step 1205 is applied to the test pattern image 1005, then complex image 1206 
results, whereas if step 1205 is applied to image 1010, then complex image 1207. 

With sub-step 1205 described in detail and referring again to Fig. 15, step 1120 
continues in sub-step 1210 where the processor 205 generates from the complex images 
1206 and 1207 images 1211 and 1212 that are substantially invariant to translations in the 

images 1005 and 1010. 

Fig. 17 shows a more detailed flow diagram of sub-step 1210, which operates on 
complex image 1360, which may be any one of complex images 1206 or 1207 formed by 
the preceding sub-step 1205. Step 1210 starts in sub-step 1410 where the processor 205 
applies an FFT to the complex image 1360 received as input. The result of the FFT 
applied in sub-step 1410 is an image with complex pixel values, and is converted in sub- 
step 1420 to an image having real pixel values only by taking the magnitude of each 
complex value. The processor 205 then applies, in sub-step 1430, an inverse FFT to the 
real image resulting from sub-step 1420 to produce a further complex image, which in 
turn is converted in sub-step 1440 to a further real image 1450 by adding the real and 
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imaginary parts of each pixel. If the complex image 1206 was received as input to step 
1210 then the real image 1450 produced is image 1211 (Fig. 15), whereas in the case 
where the complex image 1207 was received as input to step 1210 then the real image 
1450 produced is image 1212 (Fig. 15) 

5 Referring again to Fig. 15, after performing sub-step 1210, sub-step 1120 

continues in sub-step 1215 where the images 1211 and 1212 are resampled into a log- 
polar space to form images 1216 and 1217. Resampling into the log-polar space performs 
a mapping from a point having coordinates (x,y) in the Cartesian coordinate space to a 
point with coordinates (r,0), where coordinate r corresponds to the log of the distance of 

10 Cartesian coordinates (x,y) to the origin of the Cartesian coordinate space, and 9 
corresponds to the angle that the point (x,y) makes with the origin. Hence: 



log(V* 2 +y 2 ) • and < 35 > 



0oc tan"'( — 



(36) 



If an image having width W and height H is resampled to the log-polar domain or 



15 space, then the coordinate r would range between log(vV 2 + H 2 12) at the high end, 
and negative infinity at the low end. This is clearly not possible to realize, and it is 
necessary to clip the coordinate r at some small value of r. This clipping has the effect of 
excluding a disk of pixels from the centre of the image. 

In the preferred implementation a range of log r is chosen such that a disk of 
20 radius approximately 5% of the width of the image is excluded. Such a choice gives a 
suitable trade-off between losing image information, and producing a log-polar 
ampling which contains an adequate contribution of values from all areas in the input 



res 



images. 
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Fig. 18 illustrates some characteristics of resampling an image 1510 having 
Cartesian coordinates to an image 1520 in the log-polar domain. Image 1510 is 
surrounded by a region 1540 which maps to zero values in image 1520. Disk 1530 
contains pixels excluded from the resampling. The distance to any point in the image 
5 1510, excluding disk 1530, and including the surrounding region 1540 from the origin 
ranges from 0.05 to 1. In the log-polar space that range resamples to the range -3 to 0. 

Feature 1570 in image 1510 resamples to feature 1570 in the log-polar image 
1520. It can be seen that approximately two-thirds of the Cartesian image (for radius r 
between 0.36 and 1.0) is mapped to approximately one-third of the log-polar image (for 

10 log r between -1 and 0). 

Because the log-polar resampling is highly non-linear, the log-polar resampled 
images 1216 and 1217 (Fig. 15) are preferably created to be larger than the original 
Cartesian images 1211 and 1212, thereby ensuring that pixel information, other than that 
from the central disk, is not lost. A log-polar image in which the log-radial axis is 

15 approximately double, and the angular axis is approximately six times the width of the 
input image, will result in a log-polar transformation in which little information is lost due 
to the resampling. 

Assume the width W of the input image is the larger of the two image 
dimensions W and H. The resampling to the log-polar domain of sub-step 1215 starts by 
20 forming an empty log-polar image with a radial axis of length X=2W and a preferable 
angular axis of length Y=6W. 

The processor 205 then calculates for each pixel with coordinate (x^oO in the 
log-polar image an angle and radius (r,0) in the input image as follows: 

r = aexp(log(/2 I Ia)x % lX % ) ' (37) 

671600.doc 



-48 - 



6 = tan 



*4X72j 



(38) 



where R { = Vw 2 +# 2 /2. 



(39) 



Parameter a controls the radius of the disk in the centre of the input image from 
which of pixels are excluded during the resampling. The preferred radius of 5% of the 
maximum radius, which is R u is achieved with parameter a = 0.05 . 

The log-radius log(r) and angle 9 (x,y) are then converted to Cartesian 

coordinates as follows: 



Next the value from the input image at position (x,y) is interpolated, using bi- 
cubic interpolation. If the position (x,y) falls outside the original image, then a value of 0 



is used. This interpolated value is the value attributed to coordinate (xV) in the log-polar 
image. 

Referring again to Fig. 15, the log-polar resampled images 1216 and 1217 
formed by sub-step 1215 are then correlated in sub-step 1220 by the processor 205, using 
phase correlation. During phase correlation the FFT of a first input image, which is 
image 1216 in this case, is multiplied by the complex conjugate of the FFT of a second 
input image, which is image 1217 in this case, and the result of this multiplication is 
normalised to have unit magnitude. An inverse FFT is then applied to the result of the 
normalisation. 

The result of the correlation in sub-step 1220 is an image containing a magnitude 
peak, the location of which represents the scale and rotation relating the images 1005 and 



x = rcos6 +[X /2] 



(40) 



y = rsin0 + |//2_|. 



(41) 
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1010. Accordingly, peak detection is performed on the result of the correlation in sub- 
step 1225 in order to determine the location of the peak. Processor 205 uses the location 
of the peak in sub-step 1230 to determine rotation and scale parameters that relate the two 
1005 and 1010. Let the location of the peak be (p x , p y ) and let X and Y be the 



images 



5 width and height respectively of the log-polar resampled image 1216. Coordinate p y , 
with a range of [0...Y -1], represents an angle between (1 radians for p y = 0 and- 12 
radians for p = Y . Coordinate p x , with a range of [0...X -1], represents a scaling 
factor, with Px = 0 corresponding to a scaling factor of a, p x =|_X/2j corresponding to 
a scaling factor of 1. and p x = X (which does not appear in the image) corresponding to 

10 a scaling factor of 20. The preferred value of a is 0.05, though other values may be used. 

From peak location ( p x , p y ) the angle 0 and scaling factor s is derived as: 

0 =2n^Y /2j- Py )lY ( 42 > 

s = exp(log(afl_X /2]-p x )ix 1 2 J)) (43) 

Processor 205 then uses the rotation and scale parameters to rotate and scale the 
15 test pattern image 1005 in sub-step 1235, using bi-cubic interpolation. Images 1005 and 
1010 now have the same orientation and scale. The result of sub-step 1235 is then 
correlated with the imaged test pattern 1010 in sub-step 1240, again using phase 
correlation as described above. In sub-step 1245 the processor 205 performs peak 
detection on the result of the correlation in sub-step 1240. The location of the peak is 
20 used by the processor 205 in sub-step 1250 to determine the translation parameters 
< jcb , y 0 ) that relate images 1005 and 1010. Let (p x ,p y ) be the location of the peak detected 
in step 1245, then the translation parameters ( x 0 , y„) are: 
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x 0 =\_Nl2\- Px (44) 

y 0 =\_MI2\- Py («) 
where N and M are the width and height respectively of the test pattern image 



1010. 



5 Referring again to Fig. 14, sub-step 1130 where invariant pattern detection is 

performed is now described in more detail with reference to Fig. 19. As set out above, 
sub-step 1 130 is performed if the test pattern includes an alignment pattern. Sub-step 
1130 starts in sub-step 1710 where the luminance channel of image 1010 is resized by a 
process of successive halving until the resultant image is sized such that the smallest of 

10 the width and height is in the range 256 to 511 pixels. Sub-step 1710 is performed to 
improve the efficiency of the steps that follow. The successive halving may be performed 
by convolving the image with a low-pass filter and decimating the result of the 
convolution. The processor 205 then performs a two-dimensional FFT in sub-step 1720 

on the resulting resized image. 
15 Preferably, before computing the FFT in sub-step 1720, the image luminance 

values near the image edges are first attenuated so that the image luminance values fade 
to zero gradually and smoothly towards the edges of the rescaled image. The attenuation 
of luminance values at the edges of the rescaled image removes artefacts typically formed 
by the FFT. 

20 Step 1130 continues in sub-step 1730 where the processor 205 resamples the 

result from step 1720 into a quasi-polar frequency space. A complex image is produced 
wherein horizontal rows thereof correspond to radial slices in the two-dimensional FFT 
that resulted from sub-step 1720. The angular spacing and the radial scaling need not be 
constant. This may be achieved by a direct polar transform of the two-dimensional FFT 
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which resamples the FFT onto a polar grid using bi-cubic interpolation. Whilst simple, 
this method produces artefacts which can adversely affect detection of the embedded 
alignment pattern. The preferred method of performing sub-step 1730 is described below. 

The invariant pattern detection step 1130 continues in sub-step 1750 where a 
5 one-dimensional Fourier transform of the one-dimensional basis function is performed, 
with the basis function being the function from which the one-dimensional scale invariant 
patterns embedded into the test pattern image 1005 are formed, such as the basis function 
given by Equation (1). Alternatively, the basis function may be mathematically 
transformed. 

10 Next, the transform of the basis function resulting from sub-step 1750 is 

multiplied in a pixel by pixel fashion in sub-step 1760 with the complex conjugate of the 
values of the output of sub-step 1730 along horizontal rows for all angle values. Values 
along horizontal rows represent radial lines in the two-dimensional FFT. The complex 
pixel values resulting from sub-step 1760 are then normalized by the processor 205 so 

15 that the pixel values have unit magnitude. Sub-step 1770 follows where the one- 
dimensional EFFT is performed on the output of step 1760 along horizontal rows. The 
result of sub-step 1770 is a complex image with peaks in its magnitude at locations 
corresponding to the orientations and scales of the one-dimensional basis functions within 
the image 1010. The processor 205 in sub-step 1780 detects such peaks in the manner 

20 described in more detail below with reference to Fig. 25. 

Finally, in sub-step 1790 the locations of 4 of the peaks detected in sub-step 
1780 are used to determine the affine parameters that relate the images 1050 and 1010. In 
particular, the affine transformation described by linear transformation parameters 
(a n ,a 12 ,a 21 ,a 22 ,jc 0 ,y 0 ) that maps the original set of one-dimensional basis function 
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parameters, those being radius r t and angle a x , to radius s t and angle & , is determined 

from the 4 selected peaks. 

The processor 205 does so by first sorting the peaks detected in sub-step 1780 
into order of their magnitude. The peaks are also filtered such that peaks that are within 
5 10 pixels of a peak with a higher magnitude is removed from the list of peaks. The 
remaining top 64 peaks, if that many exist, are then further processed by selecting in turn 
each possible combination of 4 peaks and performing the following analysis, keeping 
track of which combination of 4 peaks best satisfies the conditions of this analysis. The 
radius s t and angle j3, of each peak are computed from the (x,y) offset of that peak in the 

10 quasi-polar map as follows: 

* 

The input image is of size (W,X' + Y') pixels. Let 
Y 2 =[Y/2] 

X 2 =lXJ2] (46) 
W 2 =\w/2] 

If y < Y\ then: 

y, = y-Y 2 

x s = x—W 2 

/3. =7T/2-tan-'^- (47) 
^Y 2 2 + y s 2 

15 else if y >= Y, 
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/3. -71- tan 




(48) 



/ 2 2 



As mentioned with reference to Equation (4), the set of preferred parameters n 
and cti has been specially chosen so that the axes of symmetry of the one-dimensional 
basis functions they represent intersect at points that define line segments that have 

5 certain ratios of lengths. As the ratios of lengths are invariant under affine 
transformations, the first condition which the combination of 4 peaks must satisfy is that 
line segments generated therefrom should have ratios of lengths that correspond to those 
of the axes of symmetry of the patterns embedded. In the case where the ratios of lengths 
do not correspond to those of the axes of symmetry of the patterns embedded the 

10 combination of peaks cannot correspond to the four original basis patterns modified by an 
affine transform and this combination is discarded. 



describe the axis of symmetry of one of the one-dimensional scale invariant patterns 
embedded in the test pattern. Rather than determine the affine transform applied to the 
15 test pattern image 1005 through the changes in these line parameters directly, the affine 
transform is determined from the intersection points of the 4 axes of symmetries specified 
by the 4 selected peaks. The intersection of two axes of symmetry represented by 
parameters {s k ,P k } and {s m ,j3 m } is labelled {x^y^), and is given by the matrix 



As previously described, the radial and angular coordinates of a peak, s t and /3,. 



equation: 



20 




(49) 
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Clearly there is no intersection if the lines are parallel, and so the equivalent 
constraint sin(j3 fe - P m )^0 is imposed. In practical situations sin 2 (/J* -/3 m )>0.25 is 
sufficient to ensure good localization of the intersection point (x^ , ) . Now, the 
parametric equation of a line specifies the linear distance of any point on that line relative 
5 to the perpendicular bisector of that line that passes through the origin. In the current case 
of four mutually non-parallel lines, each line has three intersection points along its length 
and the ratio of the intersection intervals remains invariant to affine distortions. The 



distance /L_ , along the fc th line where the m m line intersects, is given by 



th 



(50) 



10 The above equation is then enumerated for all combinations for k*m and a 

table generated which contains the locations along lines: 



'13 



"21 
A31 



'41 



-32 



"42 "43 



A, 4 

^34 



(51) 



At this stage it is useful to order the parameters by size 
ft*J max > fob.!L > = 1 * of each line and find the length ratios R k ': 



15 



R k 1 = min 



fijfcm }max fa km }mid fakm }mid fakm }min 
. fakm Imid ~ fakm 3min fakm }max ~~ fakm }mid 



<1 



(52) 



This generates 4 ratios from the 4 axes of symmetry. There are also 4 ratios that 
may be generated from the original set of one-dimensional basis function parameters r ( 

and a . If we denote these ratios as R k then we define the error in the ratio measure for 



the selected set of 4 peaks as: 
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(53) 



If the error E ra , io is greater than 0.1 the set of peaks is discarded. Alternatively, if 
the error E Tatio is less than 0.1, then the processor 205 applies a linear least squares fitting 
model to determine the best fitting affine transform that maps the set of intersection 

5 points of the axes of symmetry generated by the 4 selected peaks back to the original set 
of intersection points of the axes of symmetry of the embedded pattern. 

The preferred method of performing sub-step 1730 (Fig. 19), that is resampling 
the result from step 1720 into a quasi-polar frequency space, is now described in more 
detail with reference to Fig. 20. Sub-step 1730 starts in sub-step 1810 where the 

10 processor 205 replicates the input image (the result from step 1720), having size QC',Y% 
into two copies. In sub-step 1820, the first copy is padded with zeros in the X direction to 
a width of W=2*MAX(X',r)> resulting in an image of size (WJ% The padding is 
performed so that column offset \X'I2\ in the first copy corresponds to column offset 

\W 1 2 J in the padded image. 
15 Also, in sub-step 1830, the second copy is padded with zeros in the Y direction to 

a height of W. The padding is performed so that row offset 2 J in the second copy 
corresponds to row offset \W/2] in the padded image. Sub-step 1830 is followed by 
sub-step 1840 where the padded image resulting from sub-step 1830 is rotated by 90 
degrees, resulting in an image of size (WJC). 



respectively are transformed by the processor 205 by computing the one-dimensional 
Fourier transform is of each row. 



20 



In sub-steps 1850 and 1860 the results from, sub-steps 1820 and 1840 
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This is followed by sub-steps 1870 and 1880 where the results from sub-steps 
1850 and 1860 respectively are transformed by the processor 205 by computing 
individual chirp-Z transforms on each of the columns. Each chirp-Z transform is 
performed to preserve the centre position of each column, at positions [Y'/2j and 
5 \X'1 2 J within the columns of the results from sub-steps 1870 and 1880. 

The scaling factors m z fov each column z in the results from sub-steps 1870 and 
1880 respectively are: 

m £ =Lw/2]/(z-Lw/2j) (54) 

Each scale factor m z is negative for z < |W72j, corresponding to a vertical flip. 

10 Where the scaling factor is undefined for z = \W/2], the central pixel position is 

replicated across the whole column. 

Assuming a square image, the results from sub-steps 1870 and 1880 represent 
quasi-polar transforms of the Fourier Transforms of the resized, windowed input image, 
with the result from sub-step 1870 having angles within the range [-• /!.• /4], and the 

15 result from sub-step 1880 having angles in the range [• /4.. > /4]. If the input image is 
rectangular, the angular ranges will be from [-atan2(r,X%atan2(r,X')] and [atan2(r^0 
„• ~atan2(y ,X')]. Because each row of the quasi-polar transform contains positive and 
negative radii, it has all angles within [0.,2» ] radians. 

Sub-step 1730 ends in sub-step 1890 where the processor 205 combines the 

20 images resulting from sub-steps 1870 and 1880 to form an output image of dimension 
\WX+X') by replicating the pixels of the image resulting from sub-step 1870 into the top 
part of the output image and replicating the pixels of the image resulting from sub-step 
1880 into the bottom part of the output image. 

4 

o 
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The peak detection performed in sub-steps 1225 (Fig. 15), 1245 (Fig. 15) and 
1780 (Fig. 19) is now described in more detail with reference to Fig. 25 where a flow 

» 

diagram of a method of performing peak detection on a correlation image 1610 is shown 
in more detail. Correlation image 1610 may be a real image or complex image. Step 
1780 starts in sub-step 1620 where the processor 205 forms a list of peaks in the 
correlation image 1610, that is all points in the correlation image 1610 where the 
magnitude of the point is larger than neighbouring points. In sub-step 1630 the points are 
sorted in order of the magnitude of the pixel value. 

However, peaks may occur in noisy regions, causing many peaks to be clustered 
close together. In the case of sub-step 1780, the manner in which the parameters of the 
embedded patterns were chosen establishes that there should be only one peak in any one 
region. Also, in the case of sub-steps 1225 and 1245 only one peak should exist in the 
correlation image 1610. Accordingly, only the largest peak within a certain radial 
threshold is considered, with a preferred radial threshold being 10 pixels. In sub-step 
1640 each peak in the sorted list is considered in decreasing order of its magnitude, and 
any peak in the list that is lower in the list (smaller magnitude) and within the radial 
threshold of the peak being considered is removed from the list. 

In sub-step 1650 that follows the processor 205 truncates the sorted list of peaks 
to a length equal to the number of peaks that are expected. In the case of sub-step 1780 
this is the number of alignment patterns embedded into the test pattern, which is 4 in the 
preferred implementation. In the case of sub-steps 1225 and 1245 there should be only 
one peak. The positions of these peaks are to be determined with high precision. In sub- 
step 1660 a next peak is selected. In sub-step 1670 the processor 205 inputs a 27 by 27 
region centred on the location of the peak being considered to. an FFT, followed by a 
chirp-z transform which zooms in on the peak by a factor of 27. The chirp-z transform 
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allows computation of the discrete Fourier transform (DFT or the inverse DFT) with 
arbitrary spacing. The method works by expressing the DFT as a discrete, cyclic 
convolution. Because such convolutions can be implemented using FFTs it is possible for 
the entire computation to take advantage of the FFT speed. By suitable choice of spacing, 
5 the chirp-z transform becomes an interpolation technique, so that, for example, a DFT is 
finely sampled (that is to say zoomed) over a selected region. 

The pixel in this 27 by 27 image with the highest magnitude is determined in 
sub-step 1680, and the sub-pixel location of this peak is determined using a biparabolic 

* 

fit. This sub-pixel accurate peak location is the output of the peak detection step. 
L0 In sub-step 1685 the processor 205 determines whether more peaks are to be 

processed. If more peaks exist, then step 1780 returns to sub-step 1660 from where the 
next peak is selected and processed. Alternatively step 1780 ends. 

The foregoing describes only some embodiments of the present invention, and 
modifications and/or changes can be made thereto without departing from the scope and 
15 spirit of the invention, the embodiments being illustrative and not restrictive. 

For example, in the implementation(s) described above the luminance channels 
of the test pattern image 1005 and imaged test pattern 1010 are used for registration. 
However, some other combination of channels may be used, such as the chrominance, or - 
each single channel separately, such as the red, green or blue channel. 
20 It is sometimes desirable to pad the test pattern images 1005 with a colour that is 

similar to that of the material on which the test chart 1 10, 150, or 170 has been fabricated 
or printed. For instance, if the test chart 110, 150, or 170 is printed on white paper, the 
test pattern images 1005 may be surround by an area of white pixels to allow registration . 
of the images right to the edge of the image. 
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Furthermore, the correlation performed between blocks in the block based 
correlation (step 1035) is described as single channel correlation. Two colour channels of 
the images 1030 and 1010 may be used as real and imaginary parts of a complex image 
and complex phase correlation is then performed between the (complex) blocks. 
5 Also in the implementation of the block based correlation step 1035, the blocks 

selected for correlation are overlapping and evenly spaced. Non-overlapping blocks, or 
variably spaced and sized blocks may be used. Yet a further variation is to use non- 
square blocks. 

* 

Yet another modification to the block based correlation step 1035 is to, before 
10 the blocks are correlated, decimate or scale the images 1030 and 1010 to reduce the 
amount of computation necessary to perform the correlation step. 

Multiple application of the block based correlation step 1035 may be performed, 
where the warped test pattern image 1055 replaces the coarsely registered test pattern 
image 1030 as the input to subsequent block based correlation steps. The distortion maps 
15 so formed are then combined through composition and bi-cubic interpolation. This 
allows the application of the block based correlation step at multiple block sizes and step 
sizes to produce even better registration results at the cost of additional computation. 

In the context of this specification, the word "comprising" means "including 
principally but not necessarily solely" or "having" or "including", and not "consisting 
20 only of. Variations of the word "comprising", such as "comprise" and "comprises" have 
correspondingly varied meanings. 
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The claims defining the invention are as follows: 

1. A method of analysing images, said method comprising the steps of: 

receiving first and second images, said second image being a distorted version of 
said first image; 
5 labelling pixels of said first image with pixel labels; 

determining distortion parameters for aligning said first image with said second 

image; 

warping at least said pixel labels using said distortion parameters; and 
associating said pixel labels with corresponding pixels in said second image, 
10 wherein said labels provide information on a state of pixels in said second image before 
distortion. 

2. Apparatus for analysing images, said apparatus comprising: 

means for receiving first and second images, said second image being a distorted 

15 version of said first image; 

means for labelling pixels of said first image with pixel labels; 
means for determining distortion parameters for aligning said first image with said 
second image; 

means for warping at least said pixel labels using said distortion parameters; and 
20 means for associating said pixel labels with corresponding pixels in said second 

image, wherein said labels provide information on a state of pixels in said second image 
before distortion. 
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3. A computer readable medium comprising a computer program for analysing 
images, said computer program when executed on a computing device performs the steps 
of: 

receiving first and second images, said second image being a distorted version of 

* 

5 said first image; 

labelling pixels of said first image with pixel labels; 

determining distortion parameters for aligning said first image with said second 
image; 

m 

warping at least said pixel labels using said distortion parameters; and 
10 associating said pixel labels with corresponding pixels in said second image, 

wherein said labels provide information on a state of pixels in said second image before 
distortion. 

4. A method of analysing images, said method being substantially as herein described 
15 with reference to the accompanying drawings. 

+ 

5. Apparatus for analysing images, said apparatus being substantially as herein 
described with reference to the accompanying drawings. 

20 DATED this 3 1th Day of March 2004 

« 

CANON KABUSHIKI KAISHA 

Patent Attorneys for the Applicant 
SPRUSON&FERGUSON 
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